PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10016562m
Common NameCARUB_v10016562mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family MYB
Protein Properties Length: 1656aa    MW: 181932 Da    PI: 6.8824
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10016562mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding33.41e-10870911346
                      SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
  Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                      +WT eE e+++   + +G++ +k+Ia+++   +t  +c+++++k
  Carubv10016562m 870 PWTSEEKEIFLSMLAIHGKD-FKKIASYLT-EKTTADCIDYYYK 911
                      8*****************99.********9.9**********98 PP

2Myb_DNA-binding28.73.1e-0910851126447
                       S-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHH CS
  Myb_DNA-binding    4 WTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqky 47  
                       WT +E   +++++ ++G++ +++I+r++g +R++ qc+ ++ k+
  Carubv10016562m 1085 WTDDERSAFIQGFSLFGKN-FASISRYVG-TRSPDQCRVFFSKV 1126
                       *****************99.*********.********998776 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.17E-14853914IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.0E-7862913IPR009057Homeodomain-like
PROSITE profilePS5129314.857866917IPR017884SANT domain
SMARTSM007174.2E-10867915IPR001005SANT/Myb domain
PfamPF002491.5E-8869911IPR001005SANT/Myb domain
CDDcd001671.82E-8870912No hitNo description
PROSITE profilePS5129312.76110801131IPR017884SANT domain
SMARTSM007173.5E-810811129IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.608.6E-510841125IPR009057Homeodomain-like
SuperFamilySSF466891.4E-910841131IPR009057Homeodomain-like
PfamPF002499.0E-810851125IPR001005SANT/Myb domain
CDDcd001674.39E-710851126No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1656 aa     Download sequence    Send to blast
MPQDHASWDR KELLRQRKHD RPEQSFDSPF RWRDSPTTPS SHHVPREFSR WGGSGDFRRP  60
SCHGKQGGRH QFVEEGSHGY TSSRSSARIF ENDYYRPSAS RGDWRYTRNC RDDRASVSQK  120
EWKCNTWDMS NGSSRSFERP FGIRNGRRSV DERPLHASDT HTTMVNSLDP TNSAHQPDTE  180
ICTPVRSLKF KNEQKFSDQR LSLPSDPHSD CVRLFERPSS ENNYGNKICS PAKQCNDLMY  240
GRRIANDNSL DPPILNADLE GTWEQLHMKD PQEDNKLHGI TDLDDARKCA KESSLGAIGK  300
LPLWNSSGSF ASQSSGFSHS SSLKSVGAVD STDRKTEVLP KIATVAQSSS GDATPCATTT  360
HLFEEMSSRK KQRLGWGEGL AKYEKKKVDV NTNEDGTTLL ENGLDEQHSL NKNIADKSPT  420
AAILPDYGSP TTPSSVACSS SPGFADKSSA KAAIAASDVS NMCRSPSPVS SIHLERFPVN  480
IDELDNISME RFGCLLNELL CTDEPGTGDS SSVQLTSMNR LLAWKSEILK AVEMTESEID  540
LLENKHRTLK LEGGRHCHVG SSSYFCEGDA DVPKEQEASC ILGPKAAATP VAEALVRSPV  600
HQSSLAKVSV DVCEDNTQEV KSLSQSFATV ESNEDILPKL SMKAVTSSKE ISTPAFVNQE  660
TVELSFADDS MASNEDLLCA KLLSSNKKYA CESSGVFNEL LPRDCSFDES RYFGICQMQF  720
DSHVKEKLAD RVELLRAREK ILLLQFKAFQ LSWKKDLDQL ALTKYQSKSS RKSDLYPNAK  780
NGGYLKLPQP VRLRFSSSAP RRDSVVPTTE LVSYMEKLLL GTNLKPYRDI LRMPAMMLDE  840
RERVMSRFIS SNGLVEDPCD VEKERTMINP WTSEEKEIFL SMLAIHGKDF KKIASYLTEK  900
TTADCIDYYY KNHKSDCFGK IKKQRAYGKE GKHTYMLAPR KKWKREMGAA SLDILGAVSI  960
IAANAGKVAS TRQISSKRIT LRGCSSSNSL QHDGNNSEGC SYSFDFPRKR TVGADVLAVG  1020
PLSSEQINSC LRTSVSSRER CMDHLKFNPV VKKPRISHTL HNENSNEEDD SCSEESCGET  1080
GPIHWTDDER SAFIQGFSLF GKNFASISRY VGTRSPDQCR VFFSKVRKCL GLEFIQSGSG  1140
NLSTSVSVDN GNEGGGSDLE DPCPMESNSG ICNNGVCAKM DINSPTSPFN MNQDGANHSG  1200
SANVKADLSR SEQENGLTYI HLKDGRNLVS NAYIKGDLPG LVSESCRDLV DINTVENQSQ  1260
AAGKSKSSDL LSMEIDEGVL TSVAVSSEPL YCGLSVLSNV IVETPTESSQ MGSGDQGAAT  1320
MLKLNSKNQD GVMQAANRTK NPGLDPESAP SGFKYPECLH HVPIEVCTEN PIGVSVPRGN  1380
PNCHTEAKSG NSLVGQAVET HGLGWQFSKE NLELNGRLQV IGHVNPEQNG QLNSINAESC  1440
QIPQRSVTQD PSRISRSKSD LIVKTQRTGE GFSLNKCTSS APNSLTVSHK EGRSGHIRSH  1500
SFSLSDTERL DKNGDVKLFG TVLTADENGI KQKHNPGGSV RSSSTLSRDH DTRHHYINQQ  1560
HLQNVPITSY GFWDGNRIQT GLTSLPESAK LLASCPEAFS THLKQQVGSN KEIRRDVNGG  1620
GILSFGKHNE DRAEASSAKD GGNIGGVNGV AEAAT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C2e-15828918493NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D2e-15828918493NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006290487.10.0hypothetical protein CARUB_v10016562mg
TrEMBLR0HEB90.0R0HEB9_9BRAS; Uncharacterized protein
STRINGscaffold_502292.10.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM52602744
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein